Scaling in the Inter- Event Time of Random and 
Seasonal Systems 



Cesar A. Hidalgo R. 
chidalgo@nd.edu 



1 Center for Complex Network Research and Department of Physics, 
University of Notre Dame, Notre Dame, In. 46556 
2 Helen Kellogg Institute, Notre Dame, In, 46556 

Abstract 

Interevent times have been studied across various disciplines in search for 
correlations. In this paper we show analytical and numerical evidence that at 
the population level a power-law can be obtained by assuming poissonian 
agents with different characteristic times, and at the individual level by 
assuming poissonian agents that change the rates at which they perform an 
event in a random or deterministic fashion. The range in which we expect to 
see this behavior and the possible deviations from it are studied by considering 
the shape of the rate distribution. 
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Power-law scaling is often considered a sign of complexity. The independence 
of scale exhibited in this type of systems have fascinated many scientists that 
have attempted to explain the dynamics and correlations giving rise to this 
statistical property in different systems, such as complex networks Ej> 
fractals 4 and economic fluctuations |2| (for a complete review of power-laws 
in nature and possible mechanisms that produce them we encourage the reader 
to see [£>:). This type of function also appears in allometric laws of ecology j3IHl 
and in the distribution of interevent times of several different systems. In this 
last context, power-law scaling has been found in the stock exchange |9l IIP), 
earthquakes [111 IT2] email login times ^3|> print job submissions ^3], email 
replies |T2], regular mail TB| and browsing patterns ^7]. In all of these systems 
the distribution of interevent times scales as t~ q though the exponents tend 
to vary from system to system. Some of these exponents tend to be close to 
a = 1 , another class is close to a = 3/2 while a last class tends to be around 
a = 2. The first class belongs to systems governed by human decisions. Here 
it has been proposed, as a very likely candidate, a model based on priority 
queues, which captures this precise exponent ■ Whereas the second class 



of behavior has been observed in the response times of Einstein's and Darwin's 
corrependence [TB| . Finally, the third class of behavior has been observed in 
earthquakes and the stock exchange |§1 11UI ITT1 112) . 

In the past exponentials have been used to explain power-laws [B]. One of the 
mechanism used is to consider a variable that has an exponential distribution 

f(y) ~ e °» (1) 

and look for a variable that is related to the first one through an exponential, 
such as 

x ~ e by . (2) 
Then, the distribution of x is given by 

/(*) = M% = l~/ a/b) ^ x) = (3) 

This argument was first introduced by Miller |191 12U] to explain the power law 
distribution of words in texts. Here we study another simple way to extract a 
power-law, in this case from a fluctuating or time dependent Poisson process. In 
the latter part of this work we show that seasonal behavior can also conduct to 
power-laws in the distribution of interevent times. Seasonality is a phenomenon 
that becomes manifest in a variety of systems and the change of behavior induced 
by it can be sufficient to abandon the poisson paradigm. 

Here we are concerned with a particular example, the distribution of in- 
terevent, or waiting times. We argue that the a = 2 exponent should be ex- 
pected when we consider the distribution of interevent times in a population 
made of several regular components which are individually different, or when 
we consider individual components of heterogeneous behavior. For the first case, 
we consider that the rate that an agent performs an event in a certain time in- 
terval is given by p and the population of agents is such that we can define f(p) 
as the distribution of agents with particular p's. 

For the sake of clarity we assume to have a population of agents which send 
e-mails at a certain rate pWe also assume that the populations is big enough to 
define a distribution of rates given by f(p). We now perform a measurement in 
which we ask each agent for the time elapsed between its last two e-mails and 
make the histogram of this poll. The simplest case is the one in which everybody 
is the same and f(p) = 8{p — po), where 5 is Dirac's delta distribution. In this 
case, the global behavior matches the individual one. Thus the interevent time 
decays exponentially with a mean given by I /p. 

If we consider agents that have a stable individual behavior, but as a popu- 
lation, have a broad distribution of rates, we would be in a situation in which 
the individual behavior does not match the global one. Individually, the agents 
will send emails in a Poisson fashion allowing us to approximate their interevent 
times by their personal average, which is well defined and representative at the 
individual level. At the population level, we need to find the distribution of 
interevent times. We can do this by simply calculating the fraction of agents 
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Figure 1: a. Finite size scaling for the distribution of interevent times ob- 
tained when the individual probabilities of agents was taken from a uniform 
distribution. The straight line has slope -2. The dash dotted lines shows 
the maximum interevent time registered for a particular number of iterations. 
These and all subsequent plots were made with log-binning, b. The same re- 
sult is obtained when we consider an exponential distribution of rates given 
by f(p) — (1/8) exp (— 8p). c. When a normal distribution is chosen for /(p), 
the power-law behavior is present when it is wide (N~ [ =0.4, =0.2]) and d. 
disappears when it is narrow (N~ [ =0.4, =0.01]). 



that took less than r time units between two consecutive emails 

P(T < r) = P(l/p < r) = P{p > 1/t) = / f(x)dx, (4) 

Ji/t 

and then differentiate this expression to get the probability density 

P(T < t) = F(oo) - F(l/r) - P(T = r) = = /(1/r)^ (5) 

which scales as 1 /r 2 and has an envelope given by the original function evaluated 
at 1/t. 

Numerically, we can simulate this situation by considering a distribution f(p) 
and a sufficiently large population. We can do this by picking up a particular 
agent with a certain p from the distribution f(p) and simulate the process until 
it send an email by asking at each time step if he is sending one or not. Fig. |T| 
shows our prediction for 3 different rate distributions. In the case of a uniform 
distribution (Fig. LU a ), we have a system that behaves clearly as a power law 
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and has no envelope. We have also performed simulations with an increasing 
number of agents to show that finite size scaling defines a clear region in which 
this behavior is present. From equation © we have that when 

f( P )~U{0,L}^P(T = T) = l/T 2 . (6) 

and for an exponential distribution of rates we have 

P - a / r 

P(T = r) = — 5-. (7) 

In this case when r — * oo, e~ a/>T — > 1 and the behavior is the same as for the 
uniform distribution which can be correctly recovered when a — > 0. 

Deviations in the exponent can be found when the distribution of probabili- 
ties satisfies a power law f(p) ~ pP . Using equation (JjjJ we can find that in this 
case 

P(T = r) = t'^+V (8) 

which represents a deviation of the a — 2 exponent which occurs in the case we 
inject a power-law to the system. 

The studied cases do not introduce any cut-offs for large r. This comes from 
the fact that long interevent times come from small rates. All of the distribu- 
tions presented above have support close to zero, so in principle times can be 
infinitely long. Cut-offs in the distribution can be introduced by restricting the 
support close to zero. A simple example of this is considering the case in which 
the support of f(p) is restricted to the [p<,p>]. According to the formalism 
presented in equations Q and (JjJ , this introduces a hard cutoff at T max = 1 jp < 
for large t and at r m i„ = l/p> for small r. Thus we can say that as a rule of 
thumb 

T max ~ l/p< where p< = min[supp[f (p)}] (9) 

Tmin ~ 1/.P> where p> = max[supp[f (p)}] (10) 

We can refine this argument for T max by considering that f(p) decays in a 
smooth way as we approach the left edge of its support. Approximating f(p) 
by a power series. 

oo 

f{p) = Y J A kP \ (11) 

k=0 

and using this in equation (jSJ) we can conclude that for large r the distribution 
of interevent times follows 

"Cr-rj-fflE^. (12, 

When t — ► oo El scales as r k +2 where k' is the coefficient of the lowest order 
non- vanishing expansion coefficient. To be more precise the k' exponent domi- 
nates the distribution when r satisfies r > ((k + l)^4fe/Afe/) 1 / fc for every k. To 
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simplify this discussion, we can say that when /(p) decays linearly towards zero 
a = 3, and when f(p) decays as a parabola we have a — 4. The a — 2 exponent 
for large times is an indication that f(p) can be approximated by a constant 
close to the left edge of its support. 

The analytical results presented above were tested via numerical simulations. 
Figure (^b) confirms that for an exponential distribution of probabilities the 
scaling behavior is still clearly visible and extends through several decades. 
We also considered the case of a normal distribution. In this case the scaling 
behavior appears when the distribution is wide enough (Fig. lc) and disappears 
for narrow bells (Fig. Id) which have negligible support close to zero. 

So far, we have shown that we can expect a power-law for the distribution 
of interevent times whenever we ask heterogeneous individuals for the time 
between its last two events and poll that data together. We have argued that 
the exponent should be a — 2 when the distribution of rates is broad enough 
and it can be extended for several decades when f(p) has support close to zero, 
we have also shown that deviations from this exponent can be explained by 
consideration rate distributions that scale as a power of the rate. This argument 
can be extended even further to include a population in which we do not only 
ask agents to tell us the interevent time between their last two events, but we 
have the distribution of events for each one of them. We are concerned with the 
neutral case in which agents are not correlated in time or across the population, 
and therefore they individually follow exponential distributions. If we poll this 
data together by adding up all these distributions, we also expect a power law 
decay with a, a = 2 exponent. This can be seen clearly by adding up normalized 
exponentials representing the interevent time distribution of regular individual 
agents 

n 

P(T = r) = J2Pi^P(-Pir), (13) 

i 

and assuming a uniform distribution of rates and a large enough population we 
get 1 , 

f°° d f°° 

P(T = t)= / pexp (—pr)dp = — — / exp (— rp)dp, (14) 
Jo dr J 

which can be easily solved resulting in 
d exp (— rp) 00 



dr 



< 15 > 



This argument can be easily generalized to include any distribution f(p). In 
this case we have 

poo 

P{T = t) = / f(p)pe-P T dp (16) 
Jo 

which may sound redundant given equations Q and J5|- In fact, we introduce 
both methodos because the first one is easier to use in some cases in which 



1 This method was introduced in 1131 as a possible explanation for the interevent times. 
Although only the case with a uniform distribution of probabilities was considered. 
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Figure 2: a. Periodic behavior of the rates used to model seasonal events b. 
Interevent times obtained for a process modeled using the rates in a. c. The 
distribution of interevent time mimics a power-law with an exponent close to 
—2 (straight line has slope —2). d. The same as a. except that in this case 
smaller rates are active for longer times, e. Interevent times, f. For this case 
the distribution of inter event times mimics a power-law with an exponent close 
to -1. 



the integral given in equation (|16fl is not trivial to calculate. The conceptual 
difference of the two methods for calculating interevent time consists that in the 
first one we assume that the times coincide precisely with their expected values 
in the exponential distributions (l/f>) whereas in the second one we consider all 
possible values with associated with a given probability. In the case that f(p) 
is an exponential distribution normalized in the [0, 1] interval we have that 



1 

(a ~rf 



P{ ? = t) = j—^. (17) 



whereas when f(j>) is a power-law this second method requires us to solve 

P(T = t)= ( p^e'^dp (18) 
Jo 



which can be expressed in terms of a T function as 

P(T = t) = (- T(/3 + 2) (19) 



being hardly more useful than equation (jSJ). 

The arguments used so far have been used to show that at a population level 
we are likely to observe a power-law distribution of interevent times when we 
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poll up a large population of non-identical users. So far we have assumed that 
single users perform events at a fixed rate and act accordingly. But what can 
we expect if we allow individual agents to vary? Going back to our email user 
analogy, we can imagine that we measure the time intervals between several 
consecutive emails for a particular agents and discover that the histogram of 
interevent times is a power law with an exponent a — 2. We can show that in 
this case, an uncorrelated random process can also explain the scaling exponent. 
For this matters, let us consider an agent that initially send emails at a rate po. 
After sending an email, we record the interevent time and start waiting again. If 
Pa is fixed, the interevent time will decay exponentially, but if we allow this rate 
to change in time this would not be necessarily true. The simplest case would 
be to let po evolve in a purely random fashion, in other words, after the agent 
sends an email, we randomly draw a new rate pi from the [p<,p>] interval. If 
this is the case, this system would be the same that the one in which we consider 
several agents with different rates, and therefore, it is again obvious to expect 
an a = 2 exponent in the interevent time distribution. Thus, an agent that 
varies its behavior in a random way maps the same model that a population 
of significantly different users. The way in which an individual agent varies its 
behavior is actually not important, as long as different rates are chosen. In 
fact, we can relax the assumption that rates vary randomly and instead choose 
a periodic function for them. In figure (J2J we show an example of this process 
in which we simulate a system in which the event execution rate changes from 
0.2 to 0.02 to 0.002 to then be reseted back to its original value of 0.2 to start 
all over again. Here the rate depends only on time and does not change after 
an event is registered (figure El a). The interevent times are therefore correlated 
and the system changes periodically from activity to inactivity (figure[5]b). The 
distribution of interevent times mimics a power-law with a = 2, but upon closer 
look, one can identify three humps which coincide with the expected times of the 
three fixed probabilities involved in the process. In order to mimic a power-law 
with seasonality one needs to consider p(t) such that the values taken by this 
function are widely distributed 2 and that the function has regions in which it 
varies slowly enough (or not at all) to allow interevent times to be consistent with 
the rates proper corresponding to each time. 3 Figures (J2J a,b and c; correspond 
to a case in which p(t) mimics an exponential distribution, this is because after 
a linear time it decays an order of magnitude. We can shift the exponent in 
this case by working with a p(t) that mimics a power law. As an example we 
show the case in which we consider the same three probabilities as before (0.2, 
0.02 and 0.002) but instead of lasting the same amount of time each, we make 
them last 100, 1000 and 10000 time steps respectively. In this case the longer 
interevent times are as frequent as the shorter ones and the system mimics a 
power law with a = 1 (Figure (J2J f). 

2 we mean widely in a logarithmic sense. A similar number of values per order of magnitude. 

3 An exaggeration of this is to consider a two step function with values 0.1 and 0.01 that has 
a period of two time steps. In this case it is obvious that the system is going to be dominated 
by p = 0.1 because the period of the function is shorter than the expected time associated 
with p = 0.01. 
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In the light of the previous results and examples we have shown cases in 
which we can obtain power-laws by combining exponentials. The cutoff of the 
power-law depends on the minimum of the support of the distribution f(p) 
and the exponent for large r depends on the leading term of the power series 
expansion of /(p). In the case that the function approaches the left edge of its 
support as a constant we expect the exponent for large r to be a = 2, but in the 
case that the function approaches this edge as a function with singular support, 
we have a decrease on the exponent which is equal to the order of the singularity. 
Finally when the exponent approaches the edge as a polynomial a increases by 
the degree of the of the k' term of the polynomial for r > ((k + X)Akj Ay ) 1 / fc Vfc. 

In the case of large earthquakes, it has been shown that the interevent time 
of earthquakes larger than a given magnitude scales as a power-law with a 
a = 2 exponent jllj . Omori's law is not valid for the interevent time between 
earthquakes larger than a certain magnitude, which is the case in which you see 
this exponent. This is because Omori's law deals with the aftershocks which are 
clearly correlated. It was also argued that the a = 2 exponent indicates a time 
correlated behavior, because the interevent time distribution is not Poissonian. 
From the analytical arguments shown above, we can see that the a = 2 exponent 
is precisely what is expected for the uncorrelated case in which we consider that 
an earthquake occurs with a probability that is randomly reset after each event. 
In the case of the stock exchange ^U]; it was shown that the scaling exponent 
tends to decrease as the threshold on the normalized fluctuations increases. In 
other words, when waiting times between large fluctuations are considered, the 
scaling exponent approaches a = 1 indicating possibly a different mechanism 
like a queue model|2*T] or a power-law injection, whereas when small variations 
are considered the exponent is close to a — 2, which could be a signature of 
seasonality [5] or uncorrelated probability variations. 

Despite the simplicity of our calculations, Poisson process are usually as- 
sumed in stochastic modeling. Here we have shown that when we have a broad 
population which participates in individual poisson process or agents which have 
non-stationary behavior the waiting time distributions follows a power-law. In 
the case of a simple poisson processes the first thing that should be considered 
is that usually not all of the components of the system behave in the same way, 
this consideration alone is enough to define a scaling region that has not usually 
been considered. 
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NSF ITR 0426737, NSF ACT/SGER 0441089 and the James S. McDonnell 
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